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lim lissiiig of claims will repiaee all ptieir \?srsi<ms^ ai^d UMings, <>f claims iii the 
application: 

i. -currciiti.^ ^^r;KrjJk;a) A method Ibr automatically indexing and reuieving a mukimedia 
.eveiu.. -comprising: 

separating a muUhnedia data streaii) into audio, visual and text components; 

segi-nemmg the audio, visuiU and text components of the multimedia data stream based on 
seniiiuuc di-ibrt;ni.x\N, \xhcrcir !}a.n:t>ievci features are extracted from the segtnented audio 
eoKiponcnt as-e irs aphir<iiitY ofoubbarKis; 

identifying at least one X&tg&i speakei- using the audio a^rd visual components; 

ideBtifyiiig semantic boundaries of text for at least one of die Identified tm'get speakers to 
generate semaniicany coherent text blocks; 

generating a summar>' of multimedia content based <m ibc sudio. ■"s'isual and text 
components, tiie semanlically coherent text blocks and the identified targei sptuiker; 

deriving a topic for each of the semantically coherent text blocks h^ed on a set of topic 
category models; and 

i;on-ora:ii''S ;5 raulihnect.i Jescription of the Tni-ltin-cdij c\C!-i ba^-u-d on ihc idcnuned 
target speaker, ih-.- scmami.:all> coherent text blocks, ilv iilentifl^i lopic, and ihc gcm/raied 
smnmMf, 

?. V- 'rsT"- ' .^o -'Jc t}>oJ etain^ 1, lunher comprising; 

arit.nnaui.ar!> tUcnusN.jig a *«erarcb} of nusUjnK'JKi content lypes. 
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3 , (Or j giiiai) fhe msthod of claim 2. wherem rayltimedia coxiient types iaeliide nt least one 
oi^peakers. auclKH'f.. }nter\ie\vs, correspondence reports, multimedia cositent fsognkuit^, genera! 
nt'xs's .su>ncs, tofical news stories, nesvs summaries, md commt.'rcial$. 

■1 ^Original) I'he metbod of claim 1 , funhcr comprising; 

sion-i ershig the multin^t'dia data streaiii from aii anaiog multimedia data stream te a digital 
nv.j;t;;rcJ .4 o;'Jj s; e..'i' :.-r! 

Osx^ipicv ^-^ij -'^x^ J-xr , ^ur/rmcuia data stream. 

5, (OrigmaD I ne niets: > ' cLn r I . wha-em the extracicd audio feature.- from t'm: audio 
cor.poneni fuilho'Tosnrr'-o d": lo^ oi features. 

6 >:">5r\f ) " no :r enod Ox~ c^anrj ; . v\ herein Urn niultt Jiedfa es ^ m mclisce^ ^ .irv^uica^t 
and ihc target ispeakers iiiclude news anchorpersons, 

7, a>.i,cirar' The method of elalni I . wlierehi the step of ident--} ir^g ut iea:st one .-peaker 
Includes the proees:* of Identifying using Oaus^sian MKUire Models, 

^. iOngiaal) Fhe method ofclaim I, wherein the generated multimedia description is 
ropre^enk'd b>- at least one of a text description, a vlder description snd a sior}- icon. 

9. iOrigitiai S The method of claim 1 . further eomprising: 

j;soriug the generated muhlmedia descriptions in a database. 
I-\ Tv ^;hod of claim 1, further conipnshg, 

presenting me generated multimedia description to a user. 
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multimedia description to the user. 

12. (OdgiMl) Tlfe method of claim 1, wherem the plurality of sultods c«m|)nses three 
mhbmib. 

: 1 C >i'ginal) \ M. method ot clxn 12. \\ herein the ftamt. k%e xcaturcji, m turcv su^ba?ld^ 
^ ' 4-5 ^ crossing -ate, pjlch pcrod iTcquciKn ^.cntioia, *requenc\ 
J r t'.i, vVorA) rt%o* 

event, compri sing: 

a nultinedia ddfj 'Urcam sopuntkin unn that separates a mii'timedid d&U. ^^!.L&tn into 
audio. Yusijsl afidtext eomponems; 

0 a. eo ^ 1.0 .ii-'unv;}!* segmentation imit that scgrT>ont«> th^' <iudio. s'i:sml oxid text 
vtr^ro \> V ' ^ raljTCviM daU stream based on ft.enanrc diilcreticc^; 

: ic eNi. action \xm: that extract's auJjo ft<itures fivm ihc aud:o cor aNu\ f\ oad i k 
aui'o fcatiyc^ Ct^nmnjihg a fmmt.-lc\Cx *eat«re in a plurahts oi sobhtwids, 

a tao^gci srsejLei detecti{>n unit Ih^it identiJiei> at least one target s.peaker m>^xig. ihe audio 
and.visaai comDonenes; 

a CO ito-f^ -^v.^ -Sv. ra..i> . a i ' hdt ideistifiet^ seai.^ i jl ho i c o o ' \ v."»-l o'k^ oi 

iH jJe \^'t1vd tarjje* i-pcalvCis?, to goicnnc scnianiicalh coherent texvb'ocks, 

a sunniar^ gcneraiv i tnat generates a summai> ot mulUmcdla u->:itent ba^ed on tlK audio^ 
\ '< of\ >„\ ^ t\-' L ^P! t. > h ooherennt'xi Nocks fjjiil i> !> v\ h< \> 

atopic eaiegorissatiooanii that derives a topic semaisiiealiy eoliereni text 
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bloskp; based o» & set of topk category jBodeis; and 

xH •'cduj o%er t ivso^t on tLt identukd taiget socaU , the sc;'ra .\cA ^ ^ohctc t ex. •'fcKk^ 
K 4&fh-ti i:4 K '^d the ac ic^i.tovj ^ im.T\ars 

vJo-^M V i '\ idcr iiie*? a hieraic^n <»f mu!t media content t> pes 

sev-^ s\'i 5^"^ V, v-^-s, ^ r..v\v>.onnMi es a? J 1,!. ui rofv'a N 

V I im >>-^^ tc-digitai coii-i ertvt th^st co ive 1\ U e niulumodia aata sv^&m .rom a ^ analog 
^li, nieau dai<.stj\an to ad gtal ^vikimediadatastieaiTi a^d 

co^pscNt-^or nnn ihj. comprc«i>Ci, the digital multupedia data sinjan 

18, (Orlgma!) The system of elsim 14, wherein the multimedia even! includes a news broadcast 
ard tazgei ?f iaierj* i:^dude ncw> a. cl yrpersons. 

\mo^> ' t\a\r 14. whert-5ji hci<fi^^jt vpoaker detection mitidcntnlcs at 
v^iM s.>ai. taxg^v .*pcaf.*j» j>;r.y (Tau$s:an Vli\ture Mode>i> 

2f <^(*?'g?rah r\ s>skn? of c ' ^ • . n ^ ^ < ^s, ^ ^ <tit". ^'- 

v'>dec Oxj-sc'^'ftjon .vx^ -tOx-> Icon 

7 



ApplicaDcjivCosiiroi Nasj^ber: 10/ft86.459 Docket M«.: nSMA-Con-l 

2L ((MgiaaH The system ufckim 14. further comprising: 

8 database that stores the generated muftimedia descriptions. 

i.:, i ?.)ridn;^r: Tbe bvstem of claim J4, wherein the gcneraied muiiimedla descriptions are 
retfieved from the database and presented to a user. 

23. (Qrigfea^) 'I he systerJi of claim 223mliers;omprism^ 

a playback device tl>at plays bsck the segment of the muhhnedia cx eni ^orrcjsrxn-ditig to 
tlie generated mtiMmedia description to the user. 

24. (drigjMl) lite sy» »f d&m 14, wtein the pliiraltty of SisMjajids corapriseg5 three; 
snhhmis. 

25. (Odginal) A tsmim^ tiM displays the multimedia descriptions genemted^^ multimedia 
descoptloTt generator of claim L 



